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Fig. 7. Prwo and Qmo of a rabbit when the baickward wave reuims in late 
systole (indicaied by arrow). P,„o and Qmo have similar waveshapes before 
the uppearance of the backwaid wave. 
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summation of odd order reflected waves before they are completely 
damped, while the forward wave is the summation of the incident 
wave and even order reflected waves. It is also worth mentioning 
that most of the net flow (Qmo) occurred in systole and early 
diastole* while during late diastole the forward wave almost equaled 
the backward wave, and Qmu decreased to near zero, although theie 
was still a non-zero Pmo (Fig. 3). Physiologically, whoi the heart 
contracts periodically, the slow decay of i mo maintains the blood 
pressure in the arterial system even though the blood flow can be 
very low during diastole. 

The assumption Of linearity in this study was confirmed by the 
observation that in some rabbits, when the reflected wave returned 
to the measured site very late, the difference between the systolic 
waveforms of Pmo and Q«o was small (Fig. 7). The assumption 
of time-invariance was supported by the fact thai the pressure and 
flow waveforms of the beat preceding the asystole did not change 
if the R-R interval was constant (Fig. 1). So that, according to our 
definitions, it is appropriate to calculate Zo and RI from the initial 
Pmo. Qmo. Pjo and Pw). 

In this study, we have proposed a new mediod to analyze the 
mechanical characteristics of the carotid artery. This analytic method 
may be applied to other arterial subsystems, or to the entire arterial 
system. The method yields more accurate derivati<ms of forward and 
backward waves, and will be useful in further understanding Che 
mechanical propeittes of the arterial system and ventricular/arterial 
imeraction. 
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Ahstraet—Tfit effects of noise aulocorrdatioa on neural waveform 
recognition (detection, da.sslficath)n, and superposition resolution) are 
Investigated In this study using rokroelcctrode recording? Tnm the oortea 
of a monkey. Optimal waveform recognition is accompiisbed by passing 
the data tbrtiugh a whitening filter before matched filtering for detection 
or template matching for classification and superposilkm resolution. 
Template matching without whitening requires about 40% higher signal- 
to-noise ratio than template matching with whitening for comparable 
classification and superpiosItloD rooltttkm. The cumparabie dilTerence for 
detedton is 15%. 

I. Introduction 
Reliable recordings of the concuircnt activity of multiple neu- 
rons have become essential for the study of neural system.?. Such 
recordings are obtained with multiple extracellular electrodes, each 
providing waveforms from one or more neurons. The study presented 
here is part of an effort to develop a completely automated, optimal 
system for the recognition of action potential waveforms recorded 
with high impedance microelcctrodes in the mammalian central 
nervous system. The system is based on template matching using 
the Euclidean distance and an automatically determined acceptance 
threshold. The aim of this .«itudy is to determine the effect of noise 
autocorrelation on template matching and to present an optimal 
approach for recognition of action potential waveforms in colored 
noise. The contributions of this approach are reported for detwt- 
ing and classi^ng individual waveforms as well as for resolving 
superpositions of two waveforms. 

The most widely used technique for detecting action potentials is 
amplitude threshold crossing. Methods such as principal component.<; 
L2], and Haar uransform [15] have also been used. For optimal signal 
detection, wavefbims should be subject to Baycsian discrimination 
that separates them from background noise. 

Clas.sificatiun methods, ranging from amplitude discrimination to 
principal compottcnts and minimum mean-square-error, have been 
suggested in the last three decades [IH20]. In a comprehensive 
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comparison of separation methods, Wheeler and Heetdcrks in 1982 
evaluated the performance of nine different methods* including spike 
amplitude, conduction latency, principal components, and template 
matching using Euclidean distance [7]; the last two methods were 
found to be the best for spike sorting in noisy data. Because template 
matching paired with data whitening yields optimal perfomiance 
and because current technology allows its online implementation, 
we focused on neural spike sorting by template matching. Optimal 
and suboptimal (without whitening) template matching methods are 
compared using action potential waveforms and neural noise recorded 
from the cortex of an alert monkey. 

The simultaneous occurrence of action potentials of two or more 
neurons causes the waveforms of the individual neurons to be 
superimposed in extracellular recordings. The probability of three 
or more action potentials occurring simultaneously is negligibly 
low in most recwdings. However, two action potentials may be 
superimposed with considerable frequency depending on the number 
of neurons in the recording, their firing rates, the duration of the action 
potentials, and the timing relations between the action potentials 
ill, [11]. Failure to recognize the overiapping spikes can cause 
underestimation of the firing rales. Solutions for the superposition 
resolution problem have been suggested using template matching with 
the Euclidean distance [1] and the city block distance [19] as well as 
maximum likelihood methods [18], In this study, we evaluate the gain 
in superposition resolution performance associated with whitening, 
which together with template matching Ls theoretically optimal. 

n. Methods 

A. Data 

Recordings were obtained from the somatosensory cortex of an 
alert monkey with a filter pass-band of 10 Hz to 10 KHz, a 12-bii A/D 
converter and a sampling rate of 32 kHz. The standard deviation of 
amplitude quantization noise was less than 1 ^V. The electrodes were 
quartz glass fibers with tungsten-platinum alloy cores and 3-4 MQ 
impedance at 1 kHz.. In a recording with relatively low noi.se levels, 
five different types of action potential waveforms were isolated by 
visual inspection, and waveforms of the same type were averaged to 
form five templates, each represemed by 32 samples (1 ms). These 
templates, shown in Fig. 1(a). consiilule the spike test set used in 
this study. Recorded segments that contained no visually detectable 
action potentials were used as neural noise segments in this study. The 
neural noise was stationary and had a normally distributed amplitude 
with zero mean and a standard deviation of 39.7 /iV. Throughout 
the paper, these data (waveforms and noise) are referred to as raw 
data to distinguish them from the whitened data obtained by pai>!»ing 
the waveforms and the noise through a whitening filter. The signal- 
to-noise ratio (SNR) was based on raw, not whitened, data and was 
defined as the rms value of a waveform measured in a window of 1 
ras divided by the standard deviation of noise (39.7 /x V), The range of 
SNR*s in this study was obtained by scaling the relative amplitudes 
of templates with respect to noise and by adding scaled templates 
and noise. 

R. Whitening 

When the noise involved in the classification is stationary, normally 
distributed, and colored, optimal Bayesian classification is achieved 
by using the Mahalanobis distance [21 J, which is based on the 
invert of the noise covariance matrix. The Mahalanobis distance 
is not suitable for online applications due to the laige number of 
computations that it requires. When the noise is white, ix.. without 
temporal correlation, the noise covariance matrix is diagonal, and the 
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Fig. t. Spike templates and their noisy versions illustrate some of the noise 
conditions under which simulations were conducted, (a) Neural wavefmm 
templates. Each wavcfonn is obtained by averaging several spike waveforms 
of the same type taken from the recording. These five wavcfonns consfinite 
the spike test set. (b) Templates added to neural noise (a scaled pan of segment 
in Fig. 2(a) in which the MSEP is 10. Each template is located at die same 
position as in (a). The SNR value for each noisy template is shown below 
the corresponding wavefomi- (c> Templates added to neural noise with higher 
variance in which the MSRP is 6. 

Mahalanobis distance is reduced to the Euclidean distance. Therefore, 
when noise is colored, the simpler Euclidean distance metric can 
provide (^mmal classification if the data are first filtered with a 
whitening filter [22]. 

We computed the autocorrelation function of neural noise using 
a 16 ms noise segment from the recording mentioned above and 
observed a significant amount of conelation (see Section m). Similar 
autocorrelation was observed in several other such segments. The 
whitening filter was based on an autoiegressive moving average 
(ARMA) model of the neural noise: 

iV(fc) = Y^aiNik - 0 + - J) 

where N[k) is the iccorded neural noise. H{k) \s white noise, ai and 
lij are corresponding coefficients, and u and v are the model orders 
of the autoregressive and moving average components, respectively. 
The coefficients ai and 6> were computed to minimize the mean- 
square prediction error of the ARMA model 1231. The inverse of this 
model provided a whitening filter for the neural noise in this study. 

C. Detection 

Three techniques for detecting the occurrence of waveforms were 
studied. 
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1. Amplitude detection where the decision variable was ihc sam- 
pled voltage. 

2. Power detection where the decision variable was the sum of 
squared samples over a window of 32 samples. 

3. Matched filter detecUon where the decision variable for each 
waveform type was the inner product of the conesponding 
template with 32 samples of data. 

In each technique, the detection threshold was set at the least value 
that produced no false positive errors. 

D. Classification 

A waveform was considered to match a given template when its 
distance from that template was less than a predctcnnined acceptance 
threshold. When a waveform matched more than one template, it was 
assigned to the closest template. 

When the noise is white, the signal-plus-noise distribution for each 
waveform is hyperspherical. and optimal Bayesian classification can 
be achieved using Euclidean distance between an unknown waveform 
and known templates. In some panem recognition applications, the 
city block distance is used for its fast implenricmation. In this study, 
the classification peri'ormance of the squared Euclidean distance 
(SED) and city block distance (CBD) were compared using data with 
and without whitening. 

The SED is given by: 



k-i 

where S{k) is the data vector, T(k) is a template, and n is the number 
of sample points representing the wavefonn (32 in this study). For 
both raw data and whitened data, the acceptance thresholds were 
set at the lowest possible level that produced no exclusion errors. 
The whitened noise power had a chi-square distribution, and the 
acceptance threshold for whitened data corresponded to the 99.8% 
level of the cumulative chi-square distribution. 
The CBD is given by: 



withrf* = lS(fc)-T(fc)|. 

An acceptance threshold for the CBD has been suggested in [61. based 
on the assumption that has a normal distribution because it is the 
sum of a large number of djt's. However, the central Umit theorem 
requires the components of the summation to be independent. We 
computed the distribution of the random variable: 

fc=i 

for the colored and whitened noise, using 32 for n. In the colored 
noise, d» distribution irf dc« was skewed to tficleft and was signif- 
icantly differeni from a normal distribution. In the whitened noise, 
the distribution of dc« was close to a normal distribution, especially 
around the mean. However, the upper tail of this distribution was 
more extended than die normal, and setting die threshold with die 
assumption of normality resulted in about 5% exclusion eirors due 
to threshold, at all SNR IcveU. Therefore, we set die acceptance 
dircshold of the CBD at the lowest possible level diat resulted in no 
exclusion errors, in the colored and whitened noise tests. 

£. Superposition Resolution 

When a waveform is detected but fails to match one of die known 
templaies. Uic possibility that die wavefonn is die supciposition of 



two known types embedded in noise has to be tested. The resolution 
algorithm dial used compares die putative superposition wavefonn 
to aU possible combinations of each pair of templates wiUi aU possible 
delays in die range of -1 to +1 ms between die two templates. 
The window of diis comparison is twice as wide as the individual 
waveform window; in dus study, superpositions are resolved using 
64 samples for the waveform and die tested combinations. For each 
tested combination, a set of SED values is obtained by aligning die 
Stan of the combination widi several samples suirounding die onset of 
the unknown wavcfoim. When die minimal SED obtained among all 
SED values from all combinations is below die acceptance threshold, 
then the pair of templates oorrespcmding to dial combination is 
considered to indicate the constituent waveform types. 

As in classification, die acceptance thresholds for both raw and 
whitened data were set at die smaltest values diat caused no exclusion 
errors. 

IIL Results 

A. Whitening 

The neural noise and its autocorrelation function are shown in Fig. 
2(a) and (b). The dotted lines in Fig. 2(b) represent die diree standard 
deviation boundaries for ideal white noise. A considerable autocorre- 
lation is evident between lags of 2 and 10 samples. This significant 
amount of color in neural noise suggests diat die use of EucUdean 
distance widi raw data would be suboptimal and diat whitening is a 
prerequisite for optima! Euclidean distance classification. 

We tested various combinations for die ARMA model orders it and 
v in die range of 2 to 10 and evaluated die resulting inverse filter as 
a whitening filter for neural noise. Wc obtained die best results widi 
« = 4 and V = 3. This whitening filter is described by 

H{k) = a/bo) 

L .=1 j=i 

where 

ai = 0.046 ho = \ 

aa= 0.106 . 6i = 1.633 

a3 = -0.387 6. = 1.100 

04=0.167 &3 = 0.335. 

The recursive whitening filter designed in diis manner is stable vndi 
poles inside die unit chrclc in die complex plane. The neural noise 
segment of Fig. 2(a) was filtered with die whitening filter (initial 
conditions set to 0) and produced die whitened noise shown in Fig. 
2(c), whose autocorrelation function (Fig. 2(d)) remains in die range 
expected from white noise. 

B. Deteaion 

We compared die effects of whitening on die detection algoridims 
where signal amplitude, signal power, and output of a matched filter 
were used. The raw test data comprised the five physiological neural 
spike waveforms iUustrated in Rg. I (spike test set) and vanous 
levels of additive physiological (colored) noise. Instantaneous power 
was computed over a I ms period. Rg. 3 shows die detection results 
obtained widi die three techniques as a function of SNR. 

When matched filtering, power, and amplitude detection techniques 
were applied to raw (colored) test data. 95% correct detection was 
obtained above SNR levels of 1.5, 2. and 3. respectively. Whitenmg 
improved die perfomance of matched filter detection by approxi- 
mately 15%. Neither amplitude nor power detection were improved 
by whitening. For equal performance, matched filtering, power, and 
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Fig. 2. (a) A 16 ms Rcgmcnt of neural noise digitized ai 32 KHz. (b) The 
auuxxwrelalion function of noise in (a). The dotted lines represent the three 
standard deviation bounds expected from while noise, (c) While noise obtained 
by fihering the nuise in (a) with the whitening filter, (d) The autocorrelation 
function of white noise in (c). 



amplitude detection based on raw data required approximately 15%, 
50%, and 120% more SNR than did matched filtering detection 
based on whitened data. These ratias of SNR*s required for equal 
perfoimance were similar at all performance levels. 

C. Classification 

Classification performance depends on the similarity between dif- 
ferent spike tests as well as the level of noise in the record. 
Separability, the Euclidean distance between two spikes measured in 
units of standard deviation of noise, has been suggested as a parameter 
that reflects the ease of discriminating two spike waveforms (7J. 
The minimum separability (MSEP) in a group of spike waveforms 
is defined as the minimal Euclidean distance between waveforms 
measured in units of the standard deviation of noise [7] and serves 
as a mieasure of classification difficulty. The correspondence between 
MSEP and SNK at two levels of noise is shown in Fig. 1(b) and (c). 
In this study, the MSEP value was measured on the raw data. 

When classification tests with SED and CBD were applied to 
the raw data, 95% correct classification performance was obtained 
at MSEP values of 7.."? and 8.0, respectively (Fig. 4). Whitening 
improved the perfomiance of both algorithms, yielding 95% coirecl 
classification performance at MSEP values of 5.2 for SED and 6.0 
for CBD. Approximately 40% more MSEP (or SNR) wa<5 required 




Fig. 3. Detection lest icsults as a fimctioo of SNR defined as die ratio of the 
signal nns value (computed with 32 samples) to the standaid deviation of noise 
(computed on the complete noise record of Fig. 2(a)). Dotted curve, amplitude 
detection on raw data; dasih-dotted curve, power detection on raw data; dashed 
curve, matched ftlteiing detection on raw data; solid curve, matched filtering 
detection on whitened data. The pcifonnance of each algorithm was assessed 
whh test data containing 100 oocunaices of each type, it. 500 total ^ke 
occunences at each SNR level in the range of 0.4 to 3 with steps of 0.2. 
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Fig. 4. Classification test results as a fiiiKtion of MSEP. with and without 
whitening. Dotted and daithed curves show CBD and SED classification 
respectively, on raw data; dash-dotted and solid curves show CBD and SED 
classification, respectively, oo whitened data. Each curve was obUiined by 
classi^ng 100 wavefonns of each type, i.e., 500 wavefomis at each MSEP 
level in the range of 1 lo 16 with steps of 0.2. The MSEP was computed 
on the raw data. 

for SED classification on raw data to obtain the same performance 
as the optimal approach provided by SED on whitened data. CBD 
required 10-15% more SNR than SED for equal performance on 
both raw and whitened data. These percentage differences at 95% 
correct performance were similar fbr other performance levels, e.g. 
perfect classification pcrformanoe with SED was obtained with MS^ 
levels of 9 and 13 on whitened and raw data. Classification based 
on amplitude di.%rimtnation of the raw data (not shown in Hg. 4) 
was considerably worse, requiring SNR values 400% laiger than the 
optimal method for equal perfoniuuice. 

D. Superposition Resolution 

Superposition wavefonns were simulated by overlapping two tem- 
plates and embedding the resulting summation waveform into noise 
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SUPERPOSITION RESOLUTION 




Fin 5 SupctpositioD resolution pcrforaiancc as a ftinctioo of of *c 
aSll«t ^, to inAe icst set m pcrf omuuicc in ihe raw data is *o wn 
STdShcd line curve, and that of the whitened data is represented by *e 
SidlS^ cl:;ie. Ead; curve was obtained by ««clving 100 ^^^^ 
wavefoims at each SNR level m the range of OJi to 3 with steps of 0.23. 
iSch ^S^itioD was built with randomly selected pairs of tcmphttes with 
random delay between 0 and 28 samples. 

frum the recording. Each supcrposiuon wavcfonn was built by 
selecting the two constituent templates randomly from the spike test 
set and placing them with a random delay between 0 and 28 samples. 
Resolution tests were run on raw and whitened data using varying 
levels of additive noise. 

A decomposition was considered successful when the algontton 
identified ttie two constitucm spike types correctly. The resolution 
algorithm required an SNR of about 1 .6 for 90% conrect perfonnar^ 
on the raw data. When whitened data were used, the ^e p«for- 
mance was obtained at an SNR level of 1^. Below an SNR ofl .25, 
whitemng increased the peiccnl coirect resolution by about 20%. 
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as time quantization and sudden amplitude drop, can be accounted for 
by an appropriate modification in acceptance thresholds. Changes m 
ampUtudeTd nature of noise can affect the threshold esnmaies and 
the whitening process; these errors can be prevented by reguUrbj 
monitoring the autocorrelation of noise and updatmg *e reUted 
parameters. Erroneous template estimates can occur when the rWHd 
has an excessively high level of noise or spikes are similar in shape 
(very low sepaiabiUty). In such cases, optimal class.hcation camtoi 
be expected due to the difficulty in generating accurate templates. 

The whitening filter enhances the high frequency components 
of noise and reduces the MSEP by about 10%^ The seemmg^y 
paradoxical improvement in classification under reduced MSEP can 
be explained by the transformation that the data undergo m multidi- 
menXnal space. Ousters of each spike type have a hyperellipsoidal 
disiributiuD due to the autocorrelation of neural noise. The ec- 
centricity of these multidimensional distributions is mamfcsted by 
the eigenvalue spread of the noise covariance matrix. In our raw 
neural noise data, these eigenvalues ranged from 2.4 /i\ to about 
16000 yielding a 80 : 1 ratio between the lengths of the longest 
and shortest axes. In such cases, even when the MSEP is moderately 
low (e.g 5 to 8) the use of SED without whitemng can cause 
numerous errors. Tlie whitening filter nansfotms the hypcrclhpsoidal 
distribution of each type into a hypersphcrical distribution that can be 
optimally contained within the hypersphcrical decision shell provided 
by the SED threshold. 
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IV. DBCUSSION 
Waveform recognition algorithms based on whitened neural record- 
ings performed significantly better dtan die same algorithms operating 
on the raw data. Without whitening, classification and superposmon 
resolution algorithms required 40% greater SNR values to provide a 
pcif ormance equal to that obtained on whitened data. The comparabi e 
difference in SNR ftnr detection was 15%. 

The utility of superposition resolution was also apparent. Super- 
positions were resolved at the 95% conect level when the SNR 
was 1 75 Essentially perfect resolution was achieved at SNR of 3. 
TTie main source of error in superposition resolution is tiie similarity 
between different spike types. Furthennore, even when spike types 
are not similar, the superposition of two types may be vco^^milar to 
the superposition of two other types. At rclauvely low SNR levels, 
this can cause an additional performance degradation, whose scvwity 
depends on the superposition rate and the morphology of mdividual 

""Tn an'^'tomaied system, enor sources such as erroneous lemplaic 
estimates, time quantization errors, bascUne drifts, changes m amount 
or nature of noise, gradual changes in spike waveforms, and a sudden 
decrease in spike amplitude, e.g. when a neuron fires imme<hately 
after its previous depolarization, can degrade the classificauon perfor- 
mance. Some of these error sources can be eliminated; for exampl^ 
baseline drifts can be compensated by increasing the lower cutoff 
frequency of the analog band-pass filter, and waveform trends can be 
tracked witii adaptive template updating. Some sources of error, such 
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An Image Processing Scheme to Quantitatively 
Extract and Validate Hyoid Bone Motion Based on 
Real-Time Ultrasound Recordings of Swallowing 

Marc A. Cordaro and Bartsara C. Sonies 



Abstract — An Image processing technique with associated hardware 
was developed to (|uantltatlvely extract hyokl bone motion from real- 
time submandibular ultrasound images recorded daring the swallowing 
act Vidcofluorvgrmphk imaging, the "^old standaitT of swaUowing 
stadies, was recorded slmulUncously and synchronized to the ultrasound, 
Hyoid podlioa obtained firmn the uttrasoand was validated based on 
the videofluorosrapby using personal computer-based image processing 
methods. 



L INTRODUCTION 

In the clinical evaluation of swallowing physiology, several instni- 
mental diagnostic techniques are commonly used, including videoflu- 
orography (VP), ultrasound (US), nwnometry, and manofluorography. 
Of these, the most comnKinly used technique is videofluorogr^hy, 
which is often used exclusively and repeatedly with minimal consid- 
eration of the risks of ionizing radiation. Videofluorography provides 
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a dynamic, complete, pictorial, multiple plane visualization of the 
oropharynx during swallowing. Submandibular ultrasound provides 
a partial vi.sualization of the oropharynx, including most of the 
soft tissues such as the tongue and muscles of the mouth floor, 
without known harmful effects, but fails to directly visualize the 
bony structures of the mandible and hyoid bone. Uluiisound does 
allow dynamic visualization of movement of the tongue, transport 
of a bolus, and, indirectly, movement of the hyoid bone, and thus 
captures the oropharyngeal swallow [l]. 

One parameter of significant clinical interest is the temporospatial 
pattern of the excursion of the hyoid bone over the course of the 
oropharyngeal swallow. This pattern is closely related to the ability 
of the patient lo protect the airway during bolus transport into the 
esophagus, and is of primary clinical significance in the evaluation 
of dysphagia [2|. Quantitative analysis of hyoid bone dynamics has 
been performed on fluorographic image sequences [3J, but image 
piocessing has not been applied to the extraction of hyoid position 
from ultrasound images of the oropharynx. Such extraction would 
facilitate the transition of submandibular ultrasound from a subjective 
visual inspection methodology to a quantitative clinical research 
tool which may be safely used with children and normal subjects. 
Such use will allow the development of a quantitative database of 
swallowing parameters not previously possible with fluoroscopy due 
to the radiation risk. 

This p^r describes the application of image processing instni- 
menlation and methods to extraa the position of the hyoid hone 
from ultrasound image sequences. The technique is validated with 
a frame-by-friime comparison of hyoid bone position extracted from 
simultaneously recorded ultrasound and videofluorography over the 
course of a swallow. The complex motions and morphology of the 
swallowing mechanism make the application of a phantom study for 
calibration and validation prohibitive; however, a qualitative anatomic 
correlation has been performed using ultrasound images of a tongue 
excised fimm a human cadaver [4]. Therefore, a comparison of 
ultrasound to the "gold standard** (videofluorography) was chosen 
for this pilot study. 

II. Methodology 
The requirements of the system include the ability to record US 
and VF simultaneously and in synchrony onto video tape, digitize 
both modalities frame by frame, spatially remap frames to eliminate 
motion artifact, objectively extract hyoid bone posidon, and transform 
the VF coordinate system to perform quantitative comparisons of 
hyoid trajectory between modalities. 

A. Simultaneous Real-Time UltrasmindlVideofluorographic. 
Recording 

The real-time video signals from an Advanced Technology Labo- 
ratories (Bothell, WA) Ultramark 9 ultrasound scanner with a 5 MHz 
curved array (IVT) transducer and a standard videofluorography unit 
were synchronized in time using two For-A video timers (0.01 s 
resolution), which were slaved together with a common reset switch. 
The two nonsynchronous video signals were recorded imto separate 
Sony U-Maiic VO-5800 video cassette recorders (see Fig, 1.). This 
configuration allows bodi imaging modalities to be represented a.<i 
full resolution composite video frames widi the timer values super- 
imposed. The timer values were used to temporally align sequences 
of video frames from each medium during digitization and analysis 
of individual swallows. 
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